THE PROOF OF NON-HOMOGENEOUS Tl THEOREM VIA AVERAGING OF 

DYADIC SHIFTS 



ALEXANDER VOLBERG 



ABSTRACT. We give again a proof of non-homogeneous Tl theorem. Our proof consists of three 
main parts: a construction of a random "dyadic" lattice as in (6), (7); an estimate of matrix co- 
efficients of a Calderon-Zygmund operator with respect to random Haar basis if a smaller Haar 
■ support is good like in 0; a clever averaging trick from J2], Gl which uses the averaging over 

dyadic lattices to decompose operator into dyadic shifts eliminating the error term that was present 
^ ! in (6), Q. 
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Consider a positive finite measure a supported on a compact set E lying in \Qq, the central 
^ \ cube of the unit cube <2o of R . We recall a construction of probability space of dyadic lattices 

made in (7). Suppose f^y = 2$n(Qo) denotes the dyadic grid of squares of size 2~ N in Qq. We 
continue this grid to the whole R^. Now for each such Q there is 1 d choices of its father. As soon 
as one father is chosen, all others are fixed as parallel translation of it. Consider all choices of 
grids of fathers as equally probable. Now having one grid of fathers fixed, consider 2 d choices of 
their fathers. Choose one independently of the previous choice and again let all choices be equally 
C • ' probable. Continuing like that we build Djy-i (co), Da?_2(&)), ... In the natural probability space of 

lattices (£Iq, S?q) just built, consider a subset £1 of lattices such that a cube QG^o contains ^go- 
O ■ Consider & = J^° a ^ In and the probability space (fl, 3?) is what we will be using now. 

In what follows all 2> = S>{co) = L> k < N 3> k are from (fl, &>). So let Q be in such a Si and let Q u 
^ ; i = 1 , . . . , 2 d are its children. For any f € L l (ju) we denote S'q/ = ^ lg, 



4 := S Q f ', 

QeS> k 

and 



£ ; 4/=(4 + i-4)/,A e /:=A t /-l 2 ,eG^. 
Now let / G Lq(h) subscript meaning that / f dpi = 0. Then 

/= I A e /, 

QeS> 

and 

i=l 

where /ig are called — Haar functions and the have the following properties 

• ll*' e IU = 1 ' 

• « 2 = I m= l CQ,mlQ,„, 
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• \c Q ,m\ < l/y/n(Qm)- 

Above Q m 's are children of Q. 

Definition 1. Cube Q € &{(o) is called good ( (r, y)-good) if for any R in the same S>((o) but such 
that t(R) > 2 r £(Q) one has 

(1.1) dist(Q, sk(R)) > £(R) l ~ r l(Q) r , 

where sk(R) :={J 2 *-?dQ m , again Q m 's being all children of Q. 

Given QGft we denote g{Q) = k. Our main "tool" is going to be the famous "dyadic shifts". 
But they will be with respect to non-homogenous measure. Their typical building blocks will be 
Haar projections with respect to non-homogeneous measure fi. This is the only slight difference 
of this note with (21, (3J, 0. 

Definition 2. Precisely, we call by S m ,„ (shift of complexity (m,n), or shift of complexity max(m,n)) 
the operator given by the kernel 



/-)■ £ / a L (x,y)f{y)dn{y), 



where 



(1-2) a L (x,y)= £ c L ^ R h' Q (x)h J R (y) , 

Q<ZL,R(ZL 
g(Q)=g(L)+m,g(R)=g(L)+n 

where h'g := hg', h J R -=h R ,J are Haar functions (as above) orthogonal and normalized in L 2 (djx), 
and \c^q^r \ are such that 

(i-3) £|cL, e ,«| 2 < i. 

Q-R 

Often we will skip superscripts One will always skip superscript pt. 



Remark 1. In particular, it is easy to see that if a^ has form (11.21 ) and satisfies 

(1-4) M^)l<^, 

then (11.31 ) is automatically satisfied, and we are dealing with dyadic shift. 

Remark 2. A little bit different but basically equivalent definition can be done like that: operator 
sending A^(L 2 ( i u)) to itself and having the kernel ai(x,y) satisfying estimate (11.4b is called a local 
dyadic shift of order n. Here A£(L 2 (/i)) denotes the space of L 2 (/i) functions supported on L and 
having constant values on children Q of L such that g{Q) = g(L) + n + \. Now dyadic shift of 
order n is an operator of the form § n f := Y*LeSi lL a L{ x ^y)f(y)dy, where a^ corresponds to local 
shift of order n. 

All these definitions bring us operators satisfying obviously 

(!-6) \\^n\\ L 2 (fi)^L 2 (n) <« + l) 

We also need generalized shifts, but only of complexity (0, 1). 



Definition 3. LetYlf := Y,LeS>{f)L ijVW) T,icL,\e\=2- s \L\ c l,i - h\, where {cli} satisfy not just the 
condition H-{L)Y<£<zL,\l\=2- s \L\ |q,l| 2 — 1 tnat would be required for the usual (0,s)-shift normal- 
ization condition, but a rather stronger Carleson condition 

(1.7) £ H(L) £ \c e , L \ 2 <ii(R). 

LcR,LeSf £<ZL,\1\=2- S \L\ 
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Then IT is called a generalized shift of complexity (0,s). 

Remark 3. Notice that here h J L is a /x-Haar function. But we already skipped n and we will be 
skipping j as well. In fact, a morally same definition would be 



Les> ecL,\e\=2- s \L\ 

where clj satisfy (I1.7I ). Notice that all these Yl 's are variations of familiar paraproduct operators. 
They are of course also bounded in L 2 (£i) by a universal constant (actually by constant 2). 

2. Main Theorem 

We are in a position to formulate our main results. Recall when operator T is called an operator 
with Calderon-Zygmund kernel of order m. 

Definition 4. Let X be a geometrically doubling metric space. 

Let X(x,r) be a positive function, increasing and doubling in r, i.e. X(x,2r) ^ CX(x,r), where 
C does not depend on x and r. 

We call K(x,y) : XxX->la Calderon-Zygmund kernel, associated to a function X, if 

(2.1) \K(x,y)\ <min ' 



X(x, \xy\Y X(y, \xy\) J ' 



|xx" e 



(2.2) \K(x,y)-K(x',y)\ ^ \xy\^C\xx'\, 

(2.3) \K { x,y)-K { x,y')H w ^^ y \xy\>C\yy>\. 

By B(x, r) we denote the ball in \ .\ metric, i.e., B(x, r) = {y € X : \yx\ < r}. 

Let jx be a measure on X, such that fx (B(x, r) ) ^ CX (x, r), where C does not depend on x and r. 
We say that T is an operator with Calderon-Zygmund kernel K on our metric space X if 

(2.4) Tf{x) = J K(x,y)f(y)dn(y), V* g sup Pj u, V/ € Cq . 

We call T a Calderon-Zygmund operator, if it is an operator with Calderon-Zygmund kernel, and 
on the top of that 

T is bounded L 2 (/l) — > L 2 (}x) . 

Let us notice that M. d is of course a geometrically doubling metric space with the usual euclidean 
metric. Specify the above definition to 

X(x,r)=r' n , \xy\ = \x — y\, 

then we have a Calderon-Zygmund kernel of order m in W l and Calderon-Zygmund opera- 
tors of order m in W 1 . Measure /I satisfying n(B(x,r)) < Cr" 1 (or more generally jj.(B(x,r)) ^ 
CX (x, r)) but not satisfying any estimate from below is called non-homogeneous. The correspond- 
ing Calderon-Zygmund operator then is also called non-homogeneous. 

Theorem 2.1. Let jx(B(x,r)) < r 111 . Let T be a Calderon-Zygmund operator of order m in M. d . 
Then there exists a probability space of dyadic lattices (£2, ^) such that 

(2.5) T = c lT I n((o)d^((o) + c 2T [ n»^(ffl) + c 3r f'2^ / S n ((o) d ^ (co) . 

Ja Ja ' Ja 

Moreover, Ej > 0. Constants C\j 1 C2j,c^j depend on the Calderon-Zygmund parameters of the 
kernel, on m and d, and on the best constant in the so-called T 1 conditions: 

(2-6) \\TIq\\ 2 2^<CoH(Q), 
(2-7) \\T*l Q \\l^<CoH(Q), 
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Remark 4. The same thing holds on general geometrically doubling metric space X ( not just M. d ) 
and any non-homogeneous Calderdn—Zygmund operator having Calderdn—Zygmund kernel in the 
generalized sense above. Of course measure should satisfy 

H(B(x,r)) s^CA(x,r). 

We prefer to prove the K -version just for the sake of avoiding some slight technicalities. For 
example, the construction of the suitable probability space of random dyadic lattice on X is a bit 
more involved than such construction in M. d . See JH. 

From this we obtain immediately the non-homogeneous Tl theorem: 

Corollary 2.2. Let operator T be a Calderdn—Zygmund operator of order m. Let \lbe a measure 
of growth at most m: }x(B(x,r)) < r m . Then \^T\i.\i < C(Co,d,m,e), where £ is from Definition® 
and Co is from Tl assumptions (12.61) . (12.71) . 



Remark 5. This corollary (Tl theorem) has a long story, if fx = mj it was proved by David— 
Journe. For homogeneous (doubling) measures jX it was proved by Christ HI. In the case of 
non-homogeneous fj,, the proof is slightly involved, it has a relatively short exposition in Q. Here 
we follow the lines of Q to prove the decomposition to random dyadic shift Theorem \2.1\ Just 
Q is not quite enough for that goal, and we use a beautiful step ofHytonen as well. Then non- 
homogeneous T 1 theorem is just a corollary of the decomposition result, because all shifts of order 
n involved in (12.51 ) have norms at most n+l (see the discussion above), but decomposition (12.5b 
has an exponentially decreasing factor. 



3. Proof of d23j) 
Above good cubes were introduced. The cube is bad if it is not good. 
Lemma 3.1. @>{Qis bad} < Ci2" C2r . 

Proof. By the construction this probability gets estimated by the sum of volumes of 2~ rs , s > r, 
neighborhoods of the boundary of a unit cube (the reader can easily understand why the unit cube 
by using the scaling invariance). □ 

Lemma 3.2. By a small change in probability space we can think that &{Qis good} is indepen- 
dent of Q. 

This would be obvious if we would not pass from £2o to Q., no change would be needed. Other- 
wise, this is not quite true without the change, but take a cube Q{(o). We already know that if r is 
large and fixed 

(3.1) ^{gis good} > 1 -Ci2-' 2 '" =-PQ>\- 

Take a random variable £q((Q ), which is equally distributed on [0, 1]. We know that 

P(<2is good)=p e . 

We call Q "really good" if 

fee[0,^]. 

Otherwise Q joins bad cubes. Then 

P(<2 is really good) = i 

and we are done. 

Remark 6. We do not want to use "really good" expression below. But in fact everywhere below 
when we write "good" we mean "really good" in the above sense. We need this only to have the 
probability of being good the same for all cubes. 
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Let f,g 6 Lq(h), having constant value on each cube from 3> N . We can write 



Q j R i 



First, we state and proof the theorem, that says that essential part of bilinear form of T can 
be expressed in terms of pair of cubes, where the smallest one is good. This is almost what has 
been done in 0. The difference is that in Q an error term (very small) appeared. To eliminate 
the error term we follow the idea of Hytonen |0. In fact, the work Q improved on "good-bad" 
decomposition of |6j, Q, (H by replacing inequalities by an equality and getting rid of the error 
term. 

Theorem 3.3. Let T be any linear operator. Then the following equality holds: 

\s £ (Th j Q ,h i R )(f,h j Q )(g,ii i R ) = <? £ {Th> Q tf R )(j,h> Q )ig,it R ). 

Q,R,iJ Q,R,Lj 

t(Q)>t(R) £(Q)^£(R),Ris good 
The same is true if we replace ^ by >. 
Proof. We denote 

g(t)= £ (r^,4)(/,^)( g ,^). 

e(Q)>£(R) 

o'(T)= £ {Tti Q M R ){f,h j Q ){g,tt R )- 

Z{Q)>l{R) 
R is good 

We would like to get a relationship between S o(T) and $ o'(T). We fix R and put 

g g ood-= £ (g,ti R )h' R . 
sis good 

Then 

£ £ {Th j Q , h R ) (f, h J Q ) (g, ti R ) = ( T(f), £ {gM R )V R J = {T(f),g good ) . 
Q r is good y r is good J 

Taking expectations, we obtain 



Q.R 

(3-2) ! t 

{T{f),S ggpod ) = -(T(f),g) = -^^(Th J Q ,h' R )(f,h J Q )(g,h' R ). 

Q,R 

Next, suppose i{Q) < £(R). Then the goodness of R does not depend on Q, and so 

l - {Tti Q ,H R ){frf Q ){g,tt R ) = ((r^,4)(/,4)(g,4)l« is goodies) • 

Let us explain this equality. The right hand side is conditioned: meaning that the left hand side 
involves the fraction of two numbers: 1) the number of all lattices containing Q,R in it and such 
that 7? (the one that is larger by size) is good and 2) the number of lattices containing Q,R in it. 
This fraction is exactly % goo d = \- The equality has been explained. 

Now we fix a pair of Q,R, £(Q) < £(R), and multiply both sides by the probability that this pair 
is in the same dyadic lattice from our family. This probability is just the ratio of the number of 
dyadic lattices in our family containing elements Q and 7? to the number of all dyadic lattices in 
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our family. After multiplication by this ratio and the summation of all terms with £(Q) < 1{R) we 
get finally, 

(3.3) U £ {Thyi i R ){f,ti Q )(g,h i R ) = £ £ {Th } Q M R )(f,ti Q ){g 

i^R)lsis good • 

i{Q)<£{R) £(Q)<£(R) 
Now we use first (13.21 ) and then (13.31 ): 

^£(r^,^)(/,^)( g) 4) = ^£(r4,4)(/,^)(g,^)i Ris d = 

L Q,R Q,R 

^«)l#is good 

£(Q)<e(R) £(Q)*>£(R) 

X -S £ (Th^ R )(f,h{ 2 )(g,h i R ) + ^ £ {Th! Q ,it R )(fJ Q ){g,ft R ), 

£{Q)<£{R) e(Q)^e(R),Ris good 

and therefore 

(3-4) ^ £ (r^,4)(/ ) 4)( g) 4) = ^ £ (Th^h R )(f,h J Q )(g,h R ), 

e(Q)^£(R),Ris good 1{Q>£{ R ) 
which is the statement of our Theorem. □ 



Now we skip i, j for the sake of brevity. We have just reduced the estimate of the bilinear form 
£g Se ^(r/2g,/iR)(/,/jQ)(g,/jR) to the estimate over all dyadic lattices in our family, but summing 
over pairs Q,R, where the smaller in size is always good: £ Lg,Re^, smaller is good(ThQ,h R )(f, Hq). 
Split it to two "triangular" sums: £ZQfe@4(R)<e(Q),Ris S ood( Th QM)(f> h Q)(8,hR) and 

S £ (Th Q ,h R )(f,h Q )(g,h R ). 

Q,Re@,£(Q)<e{R),Q is good 

They are basically symmetric, so we will work only with the second sum. 

First consider Oq := $ Lqg^, gis good(T^2^e)(/>^e)C?>^e)- We do not care where Q is good 
or not and estimate the coefficient {ThQ,ho) in the most simple way. Recall that Kq = L/=i c QJ^Qj^ 
where Qj are children of Q. We also remember that \cqj\ < 1/ ^[i(Qj). Estimating 

\cQj\\cQ,f\\(T\ Qp \Q f )\ < \/J^)\/J^)ClsqW{Qi)Jl^) <Cg 

by (12.6I ). we can conclude that Oo/Cq is actually splits to at most A d shifts of order 0. 

Similarly we can can work with o s = S 'L Q ,R £ ^,QcR,e(Q)=2-'e(R),Qis goo d(ThQ,h R )(f,h Q )(g,h R ) 
for s = 1, . . . , r — 1. We need r to be large, but not too much, it depends on d only, and is chosen 
in(l3~T1). 

3.1. Decomposition of the inner sum. Now we start to work with a s ,s > r. Fix a pair Q,R, and 
let R\ be a descendant of R such that 1{R\) = 2 r £(Q). Consider the son R2 of R\ that contains Q. 
We know that Q is good, in particular, 

dist(£,dtf 2 ) > dist(fi,j*(/?i)) > €(/Ji) 1_ ^(g) r = ^(/?i)2-' T = 2 -2-' T £(/? 2 ) . 
Number 7 will be a small one so 

(3.5) dist(£, dR 2 ) > 2- r £{R { ) = £(Q) . 

We want to estimate (77?g,/ifi). 
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Lemma 3.4. Let Q C R, S{R) be the son ofR containing Q, and let dist(<2, dS{R)) > £(Q). Let T 
be a Calderdn—Zygmund operator with parameter £ in ((U). Then 



(Th Q ,h R ) = (h R } s{R) (h Q ,A Q T*l)^+t Q , R ., 



where 



m 

IqJr\S(r) dist(t,Q) + £(Q)) 
£(n) e 

ti*(t,Q)+t(Q))"*° l*A « 1 1 <M S( ^ I dH (s) | dn (t) . 
Proof. We write h R = h R ■ l R \ s{R) + ^ ls(R)- Then we continue by 

h R = (hR) sm 1 +h R ■ l R \ s{R) - (h R ) s{R)4l (1 - 1 S(R) ) • 
Then (denoting by cq the center of Q) we write 

, [K(x,y) - K(x, c Q )}h Q (y) (M sfRl djU (?) W • 

W>\S{R)JQ ■H«J>M 

Now the usual Calderon-Zygmund estimate of the kernel finishes the lemma. In this estimate we 
used ([331). □ 



After proving this lemma let us consider two integral terms above separately t\ := t\ : Q jR := 

Ir\S(R) ■■■ and ? 2 ■ = t2,Q,R ■= J K d\ R .... 

In the second integral we estimate /jq in L l (}i): < •y/ H(Q), and we estimate /is in 

L°°(jU): H/irII^o < l/y/n(S(R)). Integral itself is at most (recall that ll{B{x,r) < r" 1 )) 

(3 ' 6) Lo\S(R) (&St(LQ)+£(Q))>"+e MW " 



(dist(f,£)+£(<2)) m + e ^ w " dist(2,^(/?)) e " 
So if 2 is good, meaning that dist(g, > ^(/?) 1_r £(G) r then <f3J>]> gives us 

(3 - 7) " 1^)) J /(Sp*' 

In the first integral we estimate /?g in L l {pL): H/zgHi,^ < \J fl(Q), and we cannot estimate h R in 
L°°(ju): ||l°°(r\s(«V The problem is that this supremum is bounded by 1/ y/fi(s(R)) for a sibling 
of S(R). But because doubling is missing this can be an uncontrollably bad estimate. The 

term (^j§y) is a good term , at least it is bounded by 1, on the other hand the term (^§y) 



is not bounded by anything, it is uncontrollable. Therefore, we estimate here \\h R \\i „ < \J }i{R). 
Integral itself we are forced to estimate in L°° as L Y {}i) has been just spent. So we get the term 

dist(Q,sk(R)) m+e • 

So if Q is good, meaning that dist(Q,sk(R)) > £{R) x -^l{QY then gives us 

1/2 £{Q) e 
\h,Q,R\ < (H{Q)H(R)) ^ R y n+ £-( m+e )y^Qy m+ £)y • 

Choose 7 := 2(m £ +e) . Then we get 



(3.8) |^|<(__j __ <{j^) ■ 

We again used that fi(B(x,r) < r m ) in the last inequality. Compare now ( 13.71 ) and (13 - 8b - We see 
that for small y (and our y is small) e/2 < 1 — ye, and we can conclude that estimate (13.81 ) holds 
for both terms ?i,g,R,f2,g,fi- 
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Now notice that sums £ GC fi,f(e)=2-'--**(K) ) Gis g ood{ Th QM){fih Q ), (g,h R ), k>0, can be written 
as three sums: 

£ £ tu QiR (f, h Q),{g, h R), 

R QcR,£(Q)=2- r - k £(R),Qk good 

£ L h,Q,R(f,h Q ),(g,h R ), 

R QcR,£(Q)=2- r - k e(R),Qis good 

and 

I I (hR) s(R) {h Q ,A Q r'l) ll (f,hQ)(g,h R )- 

R QcR,£(Q)=2- r - k e(R),Qm good 

Obviously, the first sum is the bilinear form of a shift of complexity (0,r + k) having the coef- 

£(r+A-) . . 

ficient 2 in front, just look at (13- 8b - The second sum is also the bilinear form of a shift of 

(l-cy)(r+fr) 

complexity (0,r + fc) having the coefficient 2 2 in front. We just look at (13.71 ) and notice 

thatI ec5(s) ((At(0/At(5(/?)) 1/2 ) 2 <l. 

This is exactly what we need, and the part of £ Y,£(Q)<£(R),Qis good • • • , which is given by Q C R 
is represented as the sum of shifts of complexity (0,n), n > r, with exponential coefficients of the 
form 2~ Sn , n > 0. However, this is done up to the third sum. 

We cannot take care of the third sum individually. Instead we sum the third sums in all k > 
and all Uq , i = 1 , . . . , 2 — 1 , hg , j = 1 , . . . , 2 — 1 (we recall that the index / was omitted, now we 
remember it). After summing over k and i,j we get (F(L) denotes the dyadic father of L) 

l' I (sM m W F[L) ) L41 £ (A G /,A Q (ri)). 

i'=l LeSl QcL,£{Q)<2-( r -')e(L),Qis good 

We introduce the following operator: 

*(*) := 2 L I MW^Wm I A e(^i)- 

1=1 LeS> QGL,e{Q)<2-('- l >f,(L),Qisgood 

This is the same as 

71(g) = £ (A P{L) g) L ^ £ Aq(T*\) . 

LE%> QcL,f(Q)<2-('- 1 )^(L),Qisgood 

We can rewrite this formula by summing telescopically first over L6 f such that i(L) > 
2( r ~ l H(Q). Then we get (we assume J f d\l = for simplicity) 

*(*)=E(fl* I A G (ri). 

ReS? G cff,^(e)=2-(''- 1 )£(R), G isgood 

We check now by inspection the following equality 

I'l I {h i R ) m (h i Q A Q riUfM Q ){g:h R ) = 

\J-y) i,j=lk>0QcR,e(Q)=2-'- k £(R),Qisgood 

(f,JC(g)) = (**f,g). 

Operator n(g) is a (0,r — 1) generalized dyadic shift, it is just 

J=l Re3> G ci?,£(e)=2~('- 1 )£(R), G isgood 

To see that we need just to check the carleson condition. We fix L € & and we can see by (13.51) 
that the estimate 

(3.10) £ l|A G (ri)||J <Cfl(L). 

QCL,£(Q) <2-( r - 1 )£(L) , QndL=$ 
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is enough. 

To prove (13.101 ) we need 

Lemma 3.5. Let T be Calderdn—Zygmund operator satisfying (I2.7I ). Let S be any dyadic square, 
and let S 1 = LIS. Then 

£ \\A Q T*\\\l<C^(S'), 

Q<zS 

where Ci = C(Co,d,e) with Co from (12.7I ). 

Proof We write A Q T*l = A Q T*l s , + A Q T*(l - ly). The first term is easy: 

£ \\A Q T*\ s ,\\l<\\T*\ s ,\\l<Coli(S') 

just by d277b - 



To estimate £ ecS ||A G r*(l - ly)|| 2 we fix / € L 2 ^), ||/|| M = 1, we write 

(/,A G r (1 - ly) = (rA /, 1 - ly) = / / \K(x,y) - K(x,c Q )}A Q f(y)dn(y)dil(x 

JW\s' Jo L 



Such integral we already saw in (I3.6t because we can estimate it by ||Aq/|| i ^ < ||Ag/|| i ^ \/n(Q) < 
multiplied by f Rd \ s , (dist(X; g ) ff (6))m+£ < «(<2)7^) e - As / was arbitrary / G 

L 2 (m),||/|| p = 1, we get that 

||A G r(i-iy)g<c M ((2)(^) 2e . 

Taking the sum over Q we obtain 

£ l|A e r(i-iy)ll^<l|riy|| 2 <CoM(5'). 

Lemma is proved. □ 
We proved (13.101 ). but it also proves that % is a bounded generalized shift. 

3.2. The decomposition of the outer sum. We are left to decompose 

£ (Th Q ,h R )(f,h Q ,g,h R ) 

Qnfi=0,£(2)<£(R),2isgood 

into the bilinear form of (j, f )-shifts with exponentially small in max(s,f) coefficients. 
Denote 

D(Q,R):=£(Q)+dist(Q,R)+£(R). 

Also let L(Q,R) be adyadic interval from the same lattice such that £(L(Q,R) £ 2D(Q,R),4D(Q,R)) 
that contains 7?. 

Exactly as we did this before we can estimate 

{Th Q ,h R )= [ [ [K(x,y)-K(x,c Q )}h Q (y)h R (x)dn(y)dn(x) 

JRJQ 

by estimating \\h Q \\^ < Vju(G), INkj* < Vj^W. and J^iy^ < £(Q) e ' 2 / £(Rfl 2+m if dist(fi,tf) < 
£(R). Otherwise the estimate is £(Q) e / D(Q,R)"' +£ . These two estimates are both united into the 
following one obviously 

£(Q)y/2 £{Rfl 2 



(3.1D \ {THqM) \< C {^l) D ^ R)m+e/2 v^)vm- 

Of course in this estimate we used not only that Q is good, but also that £(Q) < 2~ r £(R). Only 
having this latter condition we can apply the estimate on dist(<2,7?) from Definition Q] that was 
used in getting (13- lib - 
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However, if £(Q) £ [2 r £(R),£(R) we use just a trivial estimate of coefficient (ThQ,h R )\, 
namely 

(3.12) \(Th Q ,h R )\<C(C Q ,d), 

where Co is from (12.61 . This is not dangerous at all because such pairs Q,R will be able to form 
below only shifts of complexity (s,t), where < s < t < r; the number of such shifts is at most 

2 , and let us recall, that r is not a large number, it depends only on d (see (13. lb ). 

Now in a given ^Efia pair of Q,R may or may not be inside L(Q,R) (R C L(Q,R) by 
definition). But the ration of nice lattices (these are those when both Q,R are inside L(Q,R)) 
with respect to all lattices in which both Q,R are present is bounded away from zero, this ration 
(probability) satisfies 

(3.13) P (Q,R)>P d >0. 
We want to modify the following expectation 

£:=^ I (Th Q ,h R )(f,h Q ,g,h R ). 

QnR=®,e(Q)<e(R),Qis good 

This expectation is really a certain sum itself, namely the sum over all lattices in Q. divided by 
)J(Q) =: M. Each time Q,R are not in a nice lattice we put zero in front of corresponding term. 
This changes very much the sum. However we can make up for that, and we can leave the sum 
unchanged if for nice lattices we put the coefficient \/p(Q,R) in front of corresponding terms 
(and keep otherwise). 
Then 

number of lattices containing Q,R 1 number of nice lattices containing Q, R 
M = p(Q,R) M ' 

Notice that the original sum £ terms Q,R multiplied by the LHS. The modified sum will contain 
the same terms multiplied by the RHS. So it is not modified at all, it is the same sum exactly! We 
can write it again as 

S £ m(Q,R,(o)(Th Q ,h R )(f,h Q ,g,h R ), 

QnR=®,e(Q)<i{R),Qis good 

where the random coefficients m(Q,R, omega) are either (if the lattice S = ft) is not nice), or 
l/p(Q,R) if the lattice is nice. 

Now let us fix two positive integers s > t, fix a lattice, and consider this latter sum only for this 
lattice, and write it as 

III £ m(P,Q)(ThQ,h R )(f,h Q )(g,h R ) =:EX>*. 

s t L Q<zL,R<zL,e(Q)=2-'l(L),l(R)=2- s l(L) s ' 

Each G st is a dyadic shift of complexity (s,t). In fact, use (13.131) and (13.1 11 ) and easily see 
that the sum of squares of coefficients inside each L is bounded (we use again n(B(x,r) < r m ). 

Moreover, the terms {j^j > f d(qr) ) ^ rom EED gives us the desired exponentially small 
coefficient whose size is at most 2-^-') I 2 ■ 2- £ V 2 = 2- £t l 2 = 2- £max ( 4 V)/2. 
Theorem 12. 1 l is completely proved. 
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